Overview

Dataset statistics

Number of variables21
Number of observations3333
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory710.1 KiB
Average record size in memory218.2 B

Variable types

NUM16
BOOL3
CAT2

Reproduction

Analysis started2020-07-27 13:13:46.939442
Analysis finished2020-07-27 13:14:32.514025
Duration45.57 seconds
Versionpandas-profiling v2.8.0
Command linepandas_profiling --config_file config.yaml [YOUR_FILE.csv]
Download configurationconfig.yaml

Warnings

State has a high cardinality: 51 distinct values High cardinality
TotalMorCharge is highly correlated with TotalMorMinHigh correlation
TotalMorMin is highly correlated with TotalMorChargeHigh correlation
TotalEveCharge is highly correlated with TotalEveMinHigh correlation
TotalEveMin is highly correlated with TotalEveChargeHigh correlation
TotalNightCharge is highly correlated with TotalNightMinHigh correlation
TotalNightMin is highly correlated with TotalNightChargeHigh correlation
TotalIntCharge is highly correlated with TotalIntMinutesHigh correlation
TotalIntMinutes is highly correlated with TotalIntChargeHigh correlation
PhoneNumber has unique values Unique
NumEmailMessages has 2411 (72.3%) zeros Zeros
CustomerServiceCalls has 697 (20.9%) zeros Zeros

Variables

State
Categorical

HIGH CARDINALITY

Distinct count51
Unique (%)1.5%
Missing0
Missing (%)0.0%
Memory size26.0 KiB
WV
 
106
MN
 
84
NY
 
83
AL
 
80
WI
 
78
Other values (46)
2902
ValueCountFrequency (%) 
WV1063.2%
 
MN842.5%
 
NY832.5%
 
AL802.4%
 
WI782.3%
 
OR782.3%
 
OH782.3%
 
WY772.3%
 
VA772.3%
 
CT742.2%
 
Other values (41)251875.5%
 

Length

Max length2
Median length2
Mean length2
Min length2

AccountLength
Real number (ℝ≥0)

Distinct count212
Unique (%)6.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean101.06480648064806
Minimum1
Maximum243
Zeros0
Zeros (%)0.0%
Memory size26.0 KiB

Quantile statistics

Minimum1
5-th percentile35
Q174
median101
Q3127
95-th percentile167
Maximum243
Range242
Interquartile range (IQR)53

Descriptive statistics

Standard deviation39.82210593
Coefficient of variation (CV)0.3940254508
Kurtosis-0.1078359806
Mean101.0648065
Median Absolute Deviation (MAD)27
Skewness0.09660629423
Sum336849
Variance1585.800121
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
105431.3%
 
87421.3%
 
93401.2%
 
101401.2%
 
90391.2%
 
86381.1%
 
95381.1%
 
116371.1%
 
100371.1%
 
112361.1%
 
Other values (202)294388.3%
 
ValueCountFrequency (%) 
180.2%
 
21< 0.1%
 
350.2%
 
41< 0.1%
 
51< 0.1%
 
ValueCountFrequency (%) 
2431< 0.1%
 
2321< 0.1%
 
22520.1%
 
22420.1%
 
2211< 0.1%
 

AreaCode
Categorical

Distinct count3
Unique (%)0.1%
Missing0
Missing (%)0.0%
Memory size26.0 KiB
415
1655
510
840
408
838
ValueCountFrequency (%) 
415165549.7%
 
51084025.2%
 
40883825.1%
 

Length

Max length3
Median length3
Mean length3
Min length3

PhoneNumber
Real number (ℝ≥0)

UNIQUE

Distinct count3333
Unique (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3746291.289528953
Minimum3271058
Maximum4229964
Zeros0
Zeros (%)0.0%
Memory size26.0 KiB

Quantile statistics

Minimum3271058
5-th percentile3324290.2
Q13508680
median3748187
Q33985970
95-th percentile4174024.6
Maximum4229964
Range958906
Interquartile range (IQR)477290

Descriptive statistics

Standard deviation274662.5738
Coefficient of variation (CV)0.07331586161
Kurtosis-1.224780245
Mean3746291.29
Median Absolute Deviation (MAD)239266
Skewness0.009732142906
Sum1.248638887e+10
Variance7.543952942e+10
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
36659181< 0.1%
 
38185931< 0.1%
 
33595011< 0.1%
 
41234041< 0.1%
 
41888501< 0.1%
 
38689841< 0.1%
 
37342741< 0.1%
 
33840651< 0.1%
 
35691871< 0.1%
 
33287641< 0.1%
 
Other values (3323)332399.7%
 
ValueCountFrequency (%) 
32710581< 0.1%
 
32713191< 0.1%
 
32730531< 0.1%
 
32735871< 0.1%
 
32738501< 0.1%
 
ValueCountFrequency (%) 
42299641< 0.1%
 
42283441< 0.1%
 
42283331< 0.1%
 
42282681< 0.1%
 
42277281< 0.1%
 
Distinct count2
Unique (%)0.1%
Missing0
Missing (%)0.0%
Memory size26.0 KiB
No
3010
Yes
 
323
ValueCountFrequency (%) 
No301090.3%
 
Yes3239.7%
 
Distinct count2
Unique (%)0.1%
Missing0
Missing (%)0.0%
Memory size26.0 KiB
No
2411
Yes
922
ValueCountFrequency (%) 
No241172.3%
 
Yes92227.7%
 

NumEmailMessages
Real number (ℝ≥0)

ZEROS

Distinct count46
Unique (%)1.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean8.099009900990099
Minimum0
Maximum51
Zeros2411
Zeros (%)72.3%
Memory size26.0 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q320
95-th percentile36
Maximum51
Range51
Interquartile range (IQR)20

Descriptive statistics

Standard deviation13.68836537
Coefficient of variation (CV)1.690128243
Kurtosis-0.05112853879
Mean8.099009901
Median Absolute Deviation (MAD)0
Skewness1.264823634
Sum26994
Variance187.3713466
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0241172.3%
 
31601.8%
 
29531.6%
 
28511.5%
 
33461.4%
 
27441.3%
 
30441.3%
 
24421.3%
 
26411.2%
 
32411.2%
 
Other values (36)50015.0%
 
ValueCountFrequency (%) 
0241172.3%
 
41< 0.1%
 
820.1%
 
920.1%
 
101< 0.1%
 
ValueCountFrequency (%) 
511< 0.1%
 
5020.1%
 
491< 0.1%
 
4820.1%
 
4730.1%
 

TotalMorMin
Real number (ℝ≥0)

HIGH CORRELATION

Distinct count1681
Unique (%)50.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean179.8114731473147
Minimum0.0
Maximum350.8
Zeros2
Zeros (%)0.1%
Memory size26.0 KiB

Quantile statistics

Minimum0
5-th percentile90.4
Q1143.7
median179.3
Q3216.4
95-th percentile270.58
Maximum350.8
Range350.8
Interquartile range (IQR)72.7

Descriptive statistics

Standard deviation54.33331286
Coefficient of variation (CV)0.3021682205
Kurtosis-0.01221507587
Mean179.8114731
Median Absolute Deviation (MAD)36.3
Skewness-0.02543093921
Sum599311.64
Variance2952.108886
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
15480.2%
 
159.580.2%
 
174.580.2%
 
175.470.2%
 
162.370.2%
 
183.470.2%
 
142.360.2%
 
181.560.2%
 
198.460.2%
 
194.860.2%
 
Other values (1671)326497.9%
 
ValueCountFrequency (%) 
020.1%
 
2.61< 0.1%
 
7.81< 0.1%
 
7.91< 0.1%
 
12.51< 0.1%
 
ValueCountFrequency (%) 
350.81< 0.1%
 
346.81< 0.1%
 
345.31< 0.1%
 
337.41< 0.1%
 
335.51< 0.1%
 

TotalMorCalls
Real number (ℝ≥0)

Distinct count119
Unique (%)3.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean100.43564356435644
Minimum0
Maximum165
Zeros2
Zeros (%)0.1%
Memory size26.0 KiB

Quantile statistics

Minimum0
5-th percentile67
Q187
median101
Q3114
95-th percentile133
Maximum165
Range165
Interquartile range (IQR)27

Descriptive statistics

Standard deviation20.06908421
Coefficient of variation (CV)0.1998203376
Kurtosis0.2431815246
Mean100.4356436
Median Absolute Deviation (MAD)13
Skewness-0.111786639
Sum334752
Variance402.7681409
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
102782.3%
 
105752.3%
 
107692.1%
 
95692.1%
 
104682.0%
 
108672.0%
 
97672.0%
 
110662.0%
 
106662.0%
 
88662.0%
 
Other values (109)264279.3%
 
ValueCountFrequency (%) 
020.1%
 
301< 0.1%
 
351< 0.1%
 
361< 0.1%
 
4020.1%
 
ValueCountFrequency (%) 
1651< 0.1%
 
1631< 0.1%
 
1601< 0.1%
 
15830.1%
 
1571< 0.1%
 

TotalMorCharge
Real number (ℝ≥0)

HIGH CORRELATION

Distinct count1667
Unique (%)50.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean30.562307230723075
Minimum0.0
Maximum59.64
Zeros2
Zeros (%)0.1%
Memory size26.0 KiB

Quantile statistics

Minimum0
5-th percentile15.288
Q124.43
median30.5
Q336.79
95-th percentile46.028
Maximum59.64
Range59.64
Interquartile range (IQR)12.36

Descriptive statistics

Standard deviation9.259434554
Coefficient of variation (CV)0.3029690947
Kurtosis-0.01981178724
Mean30.56230723
Median Absolute Deviation (MAD)6.17
Skewness-0.02908326834
Sum101864.17
Variance85.73712826
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
27.1280.2%
 
26.1880.2%
 
29.6780.2%
 
31.1870.2%
 
27.5970.2%
 
29.8270.2%
 
24.1960.2%
 
28.6660.2%
 
36.7260.2%
 
26.160.2%
 
Other values (1657)326497.9%
 
ValueCountFrequency (%) 
020.1%
 
0.441< 0.1%
 
1.331< 0.1%
 
1.341< 0.1%
 
2.131< 0.1%
 
ValueCountFrequency (%) 
59.641< 0.1%
 
58.961< 0.1%
 
58.71< 0.1%
 
57.361< 0.1%
 
57.041< 0.1%
 

TotalEveMin
Real number (ℝ≥0)

HIGH CORRELATION

Distinct count1617
Unique (%)48.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean200.95359735973597
Minimum0.0
Maximum363.7
Zeros1
Zeros (%)< 0.1%
Memory size26.0 KiB

Quantile statistics

Minimum0
5-th percentile118.8
Q1166.7
median201.2
Q3235.3
95-th percentile284.3
Maximum363.7
Range363.7
Interquartile range (IQR)68.6

Descriptive statistics

Standard deviation50.68826178
Coefficient of variation (CV)0.2522386384
Kurtosis0.0305952821
Mean200.9535974
Median Absolute Deviation (MAD)34.2
Skewness-0.0226241651
Sum669778.34
Variance2569.299882
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
169.990.3%
 
220.670.2%
 
230.970.2%
 
180.570.2%
 
20170.2%
 
161.770.2%
 
167.270.2%
 
195.570.2%
 
209.470.2%
 
203.860.2%
 
Other values (1607)326297.9%
 
ValueCountFrequency (%) 
01< 0.1%
 
31.21< 0.1%
 
42.21< 0.1%
 
42.51< 0.1%
 
43.91< 0.1%
 
ValueCountFrequency (%) 
363.71< 0.1%
 
361.81< 0.1%
 
354.21< 0.1%
 
351.61< 0.1%
 
350.91< 0.1%
 

TotalEveCalls
Real number (ℝ≥0)

Distinct count123
Unique (%)3.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean100.11431143114311
Minimum0
Maximum170
Zeros1
Zeros (%)< 0.1%
Memory size26.0 KiB

Quantile statistics

Minimum0
5-th percentile67
Q187
median100
Q3114
95-th percentile133
Maximum170
Range170
Interquartile range (IQR)27

Descriptive statistics

Standard deviation19.92262529
Coefficient of variation (CV)0.1989987746
Kurtosis0.206156468
Mean100.1143114
Median Absolute Deviation (MAD)13
Skewness-0.05556313904
Sum333681
Variance396.9109986
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
105802.4%
 
94792.4%
 
108712.1%
 
97702.1%
 
102702.1%
 
88692.1%
 
101682.0%
 
109672.0%
 
98662.0%
 
111652.0%
 
Other values (113)262878.8%
 
ValueCountFrequency (%) 
01< 0.1%
 
121< 0.1%
 
361< 0.1%
 
371< 0.1%
 
421< 0.1%
 
ValueCountFrequency (%) 
1701< 0.1%
 
1681< 0.1%
 
1641< 0.1%
 
1591< 0.1%
 
1571< 0.1%
 

TotalEveCharge
Real number (ℝ≥0)

HIGH CORRELATION

Distinct count1440
Unique (%)43.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean17.083540354035403
Minimum0.0
Maximum30.91
Zeros1
Zeros (%)< 0.1%
Memory size26.0 KiB

Quantile statistics

Minimum0
5-th percentile10.1
Q114.16
median17.12
Q320
95-th percentile24.17
Maximum30.91
Range30.91
Interquartile range (IQR)5.84

Descriptive statistics

Standard deviation4.310667643
Coefficient of variation (CV)0.2523287067
Kurtosis0.02548740481
Mean17.08354035
Median Absolute Deviation (MAD)2.92
Skewness-0.02385798901
Sum56939.44
Variance18.58185553
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
14.25110.3%
 
16.12110.3%
 
15.9100.3%
 
18.6290.3%
 
14.4490.3%
 
17.0990.3%
 
17.9990.3%
 
18.7980.2%
 
16.6380.2%
 
17.4380.2%
 
Other values (1430)324197.2%
 
ValueCountFrequency (%) 
01< 0.1%
 
2.651< 0.1%
 
3.591< 0.1%
 
3.611< 0.1%
 
3.731< 0.1%
 
ValueCountFrequency (%) 
30.911< 0.1%
 
30.751< 0.1%
 
30.111< 0.1%
 
29.891< 0.1%
 
29.831< 0.1%
 

TotalNightMin
Real number (ℝ≥0)

HIGH CORRELATION

Distinct count1598
Unique (%)47.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean200.8676807680768
Minimum23.2
Maximum395.0
Zeros0
Zeros (%)0.0%
Memory size26.0 KiB

Quantile statistics

Minimum23.2
5-th percentile118.18
Q1167.1
median201.1
Q3235.3
95-th percentile282.84
Maximum395
Range371.8
Interquartile range (IQR)68.2

Descriptive statistics

Standard deviation50.5415288
Coefficient of variation (CV)0.2516160321
Kurtosis0.09150042041
Mean200.8676808
Median Absolute Deviation (MAD)34.2
Skewness0.009892727055
Sum669491.98
Variance2554.446134
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
188.280.2%
 
191.480.2%
 
197.480.2%
 
214.680.2%
 
221.680.2%
 
21080.2%
 
194.370.2%
 
193.670.2%
 
214.770.2%
 
206.170.2%
 
Other values (1588)325797.7%
 
ValueCountFrequency (%) 
23.21< 0.1%
 
43.71< 0.1%
 
451< 0.1%
 
47.41< 0.1%
 
50.120.1%
 
ValueCountFrequency (%) 
3951< 0.1%
 
381.91< 0.1%
 
377.51< 0.1%
 
367.71< 0.1%
 
364.91< 0.1%
 

TotalNightCalls
Real number (ℝ≥0)

Distinct count120
Unique (%)3.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean100.10771077107711
Minimum33
Maximum175
Zeros0
Zeros (%)0.0%
Memory size26.0 KiB

Quantile statistics

Minimum33
5-th percentile68
Q187
median100
Q3113
95-th percentile132
Maximum175
Range142
Interquartile range (IQR)26

Descriptive statistics

Standard deviation19.56860935
Coefficient of variation (CV)0.1954755452
Kurtosis-0.07201957894
Mean100.1077108
Median Absolute Deviation (MAD)13
Skewness0.03249957015
Sum333659
Variance382.9304717
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
105842.5%
 
104782.3%
 
91762.3%
 
102722.2%
 
100692.1%
 
106692.1%
 
98672.0%
 
94662.0%
 
103652.0%
 
108641.9%
 
Other values (110)262378.7%
 
ValueCountFrequency (%) 
331< 0.1%
 
361< 0.1%
 
381< 0.1%
 
4220.1%
 
441< 0.1%
 
ValueCountFrequency (%) 
1751< 0.1%
 
1661< 0.1%
 
1641< 0.1%
 
1581< 0.1%
 
15720.1%
 

TotalNightCharge
Real number (ℝ≥0)

HIGH CORRELATION

Distinct count933
Unique (%)28.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean9.03932493249325
Minimum1.04
Maximum17.77
Zeros0
Zeros (%)0.0%
Memory size26.0 KiB

Quantile statistics

Minimum1.04
5-th percentile5.316
Q17.52
median9.05
Q310.59
95-th percentile12.73
Maximum17.77
Range16.73
Interquartile range (IQR)3.07

Descriptive statistics

Standard deviation2.275872838
Coefficient of variation (CV)0.2517746463
Kurtosis0.08566317984
Mean9.039324932
Median Absolute Deviation (MAD)1.54
Skewness0.008886236769
Sum30128.07
Variance5.179597173
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
9.66150.5%
 
9.45150.5%
 
8.88140.4%
 
8.47140.4%
 
7.69130.4%
 
8.64120.4%
 
9.14110.3%
 
10.35110.3%
 
10.8110.3%
 
9.32110.3%
 
Other values (923)320696.2%
 
ValueCountFrequency (%) 
1.041< 0.1%
 
1.971< 0.1%
 
2.031< 0.1%
 
2.131< 0.1%
 
2.2520.1%
 
ValueCountFrequency (%) 
17.771< 0.1%
 
17.191< 0.1%
 
16.991< 0.1%
 
16.551< 0.1%
 
16.421< 0.1%
 

TotalIntMinutes
Real number (ℝ≥0)

HIGH CORRELATION

Distinct count162
Unique (%)4.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean10.237293729372938
Minimum0.0
Maximum20.0
Zeros18
Zeros (%)0.5%
Memory size26.0 KiB

Quantile statistics

Minimum0
5-th percentile5.7
Q18.5
median10.3
Q312.1
95-th percentile14.7
Maximum20
Range20
Interquartile range (IQR)3.6

Descriptive statistics

Standard deviation2.791839548
Coefficient of variation (CV)0.2727126546
Kurtosis0.6091847602
Mean10.23729373
Median Absolute Deviation (MAD)1.8
Skewness-0.2451359395
Sum34120.9
Variance7.794368064
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
10621.9%
 
11.3591.8%
 
9.8561.7%
 
10.9561.7%
 
10.1531.6%
 
10.2531.6%
 
10.6531.6%
 
11.1521.6%
 
11521.6%
 
9.7511.5%
 
Other values (152)278683.6%
 
ValueCountFrequency (%) 
0180.5%
 
1.11< 0.1%
 
1.31< 0.1%
 
220.1%
 
2.120.1%
 
ValueCountFrequency (%) 
201< 0.1%
 
18.91< 0.1%
 
18.41< 0.1%
 
18.31< 0.1%
 
18.220.1%
 

TotalIntCalls
Real number (ℝ≥0)

Distinct count21
Unique (%)0.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.4794479447944795
Minimum0
Maximum20
Zeros18
Zeros (%)0.5%
Memory size26.0 KiB

Quantile statistics

Minimum0
5-th percentile1
Q13
median4
Q36
95-th percentile9
Maximum20
Range20
Interquartile range (IQR)3

Descriptive statistics

Standard deviation2.461214271
Coefficient of variation (CV)0.5494458917
Kurtosis3.083588982
Mean4.479447945
Median Absolute Deviation (MAD)1
Skewness1.321478166
Sum14930
Variance6.057575686
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
366820.0%
 
461918.6%
 
248914.7%
 
547214.2%
 
633610.1%
 
72186.5%
 
11604.8%
 
81163.5%
 
91093.3%
 
10501.5%
 
Other values (11)962.9%
 
ValueCountFrequency (%) 
0180.5%
 
11604.8%
 
248914.7%
 
366820.0%
 
461918.6%
 
ValueCountFrequency (%) 
201< 0.1%
 
191< 0.1%
 
1830.1%
 
171< 0.1%
 
1620.1%
 

TotalIntCharge
Real number (ℝ≥0)

HIGH CORRELATION

Distinct count162
Unique (%)4.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.7645814581458144
Minimum0.0
Maximum5.4
Zeros18
Zeros (%)0.5%
Memory size26.0 KiB

Quantile statistics

Minimum0
5-th percentile1.54
Q12.3
median2.78
Q33.27
95-th percentile3.97
Maximum5.4
Range5.4
Interquartile range (IQR)0.97

Descriptive statistics

Standard deviation0.7537726127
Coefficient of variation (CV)0.2726534284
Kurtosis0.6096104298
Mean2.764581458
Median Absolute Deviation (MAD)0.48
Skewness-0.2452865083
Sum9214.35
Variance0.5681731516
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
2.7621.9%
 
3.05591.8%
 
2.65561.7%
 
2.94561.7%
 
2.73531.6%
 
2.86531.6%
 
2.75531.6%
 
3521.6%
 
2.97521.6%
 
2.62511.5%
 
Other values (152)278683.6%
 
ValueCountFrequency (%) 
0180.5%
 
0.31< 0.1%
 
0.351< 0.1%
 
0.5420.1%
 
0.5720.1%
 
ValueCountFrequency (%) 
5.41< 0.1%
 
5.11< 0.1%
 
4.971< 0.1%
 
4.941< 0.1%
 
4.9120.1%
 

CustomerServiceCalls
Real number (ℝ≥0)

ZEROS

Distinct count10
Unique (%)0.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.5628562856285628
Minimum0
Maximum9
Zeros697
Zeros (%)20.9%
Memory size26.0 KiB

Quantile statistics

Minimum0
5-th percentile0
Q11
median1
Q32
95-th percentile4
Maximum9
Range9
Interquartile range (IQR)1

Descriptive statistics

Standard deviation1.315491045
Coefficient of variation (CV)0.8417223368
Kurtosis1.730913655
Mean1.562856286
Median Absolute Deviation (MAD)1
Skewness1.091359482
Sum5209
Variance1.730516689
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
1118135.4%
 
275922.8%
 
069720.9%
 
342912.9%
 
41665.0%
 
5662.0%
 
6220.7%
 
790.3%
 
920.1%
 
820.1%
 
ValueCountFrequency (%) 
069720.9%
 
1118135.4%
 
275922.8%
 
342912.9%
 
41665.0%
 
ValueCountFrequency (%) 
920.1%
 
820.1%
 
790.3%
 
6220.7%
 
5662.0%
 

Churn?
Boolean

Distinct count2
Unique (%)0.1%
Missing0
Missing (%)0.0%
Memory size3.3 KiB
False
2850
True
 
483
ValueCountFrequency (%) 
False285085.5%
 
True48314.5%
 

Interactions

Correlations

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.

Phik (φk)

Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.

Cramér's V (φc)

Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.

Missing values

Sample

First rows

StateAccountLengthAreaCodePhoneNumberInternationalPlan?VoiceMailPlan?NumEmailMessagesTotalMorMinTotalMorCallsTotalMorChargeTotalEveMinTotalEveCallsTotalEveChargeTotalNightMinTotalNightCallsTotalNightChargeTotalIntMinutesTotalIntCallsTotalIntChargeCustomerServiceCallsChurn?
0KS1284153824657NoYes25265.111045.07197.49916.78244.79111.0110.032.701False
1OH1074153717191NoYes26161.612327.47195.510316.62254.410311.4513.733.701False
2NJ1374153581921NoNo0243.411441.38121.211010.30162.61047.3212.253.290False
3OH844083759999YesNo0299.47150.9061.9885.26196.9898.866.671.782False
4OK754153306626YesNo0166.711328.34148.312212.61186.91218.4110.132.733False
5AL1185103918027YesNo0223.49837.98220.610118.75203.91189.186.361.700False
6MA1215103559993NoYes24218.28837.09348.510829.62212.61189.577.572.033False
7MO1474153299001YesNo0157.07926.69103.1948.76211.8969.537.161.920False
8LA1174083354719NoNo0184.59731.37351.68029.89215.8909.718.742.351False
9WV1414153308173YesYes37258.68443.96222.011118.87326.49714.6911.253.020False

Last rows

StateAccountLengthAreaCodePhoneNumberInternationalPlan?VoiceMailPlan?NumEmailMessagesTotalMorMinTotalMorCallsTotalMorChargeTotalEveMinTotalEveCallsTotalEveChargeTotalNightMinTotalNightCallsTotalNightChargeTotalIntMinutesTotalIntCallsTotalIntChargeCustomerServiceCallsChurn?
3323IN1174153625899NoNo0118.412620.13249.39721.19227.05610.2213.633.675True
3324WV1594153771164NoNo0169.811428.87197.710516.80193.7828.7211.643.131False
3325OH784083688555NoNo0193.49932.88116.9889.94243.310910.959.342.512False
3326OH964153476812NoNo0106.612818.12284.88724.21178.9928.0514.974.021False
3327SC794153483830NoNo0134.79822.90189.76816.12221.41289.9611.853.192False
3328AZ1924154144276NoYes36156.27726.55215.512618.32279.18312.569.962.672False
3329WV684153703271NoNo0231.15739.29153.45513.04191.31238.619.642.593False
3330RI285103288230NoNo0180.810930.74288.85824.55191.9918.6414.163.812False
3331CT1845103646381YesNo0213.810536.35159.68413.57139.21376.265.0101.352False
3332TN744154004344NoYes25234.411339.85265.98222.60241.47710.8613.743.700False